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Input Set: 



Output Set: 



Started: 2007-07-26 17:06:44.142 

Finished: 2007-07-26 17:06:44.618 

Elapsed: 0 hr(s) 0 min(s) 0 sec(s) 476 ms 

Total Warnings: 5 

Total Errors: 1 

No. of SeqIDs Defined: 8 
Actual SeqID Count: 8 

Error code Error Description 

W 213 Artificial or Unknown found in <213> in SEQ ID (4) 

W 213 Artificial or Unknown found in <213> in SEQ ID (5) 

W 213 Artificial or Unknown found in <213> in SEQ ID (6) 

E 330 Invalid protein , found in SEQID (6) POS (5) Invalid Protein:Asx 

W 213 Artificial or Unknown found in <213> in SEQ ID (7) 

W 213 Artificial or Unknown found in <213> in SEQ ID (8) 



SEQUENCE LISTING 



<110> Camargo, Antonio C.M. 

Hayashi, Mirian A.F. 
Portaro, Fernanda C.V. 
Guerreiro, Juliano R. 

<120> PROCESS FOR THE DETERMINATION OF THE PRIMARY STRUCTURE OF THE MESSANGER 
RNA CODING FOR THE HUMAN RECOMBINANT ENDOOLIGOPEPTIDASE A (HEOPA) 
[AF217798] 

<130> 4705-OlllPUSl 

<140> 10/538,071 
<141> 2006-05-04 

<150> BR0205000-5 
<151> 2002-12-09 

<150> BR0305688-0 
<151> 2003-12-05 

<160> 8 

<170> Patentin version 3.3 

<210> 1 
<211> 2179 

<212> DNA 

<213> Oryctolagus cuniculus 

<400> 1 

cctgacgtgt cggggaggag ccgggtgccg aggtgcgcgg agtggagctc agggctgcgg 60 

agggaagcgg agctgagcgg ctggggcggc ctggccaggc cagcggagct gaggcgtcgg 12 0 

tcagccggcg gcgaacatgc gcttttgaca cattggaggc tttcttgatc atggatggtg 180 

aagatatacc agatttttca agtttaaagg aggaaactgc ttattggaag gaactttcct 24 0 

tgaagtataa gcaaacgttc caggaagctc gggatgagct agttgaattc caggaaggaa 30 0 

gcagagaatt agaagcagag ttggaggcac aattagtaca ggctgaacaa agaaatagag 360 

acttgcaagc tgataaccaa agactgaaat atgaagtgga ggcgctaaag gagaaactgg 42 0 

aacatcagta cgcacagagc tacaagcagg tttccgtatt agaagatgat ttaagtcaga 480 

ctcgggccat taaggagcag ttgcacaagt acgtgaggga gctggagcag gccaatgatg 54 0 

acctggagcg agcaaaaagg gcaacaatag tttcactgga agactttgaa caaaggctaa 60 0 

atcaggccat agaacggaat gcatttttag aaagtgaact tgatgaaaag gaatctttgt 660 

tggtctctgt acagcggtta aaggatgaag caagagattt gaggcaagaa ctagcagtcc 72 0 

gagaaagaca acaggaagtg accagaaagt cggctcctag ttctccaact ctagactgtg 780 



aaaagatgga ttctgctgta caagcatcac tttccttgcc agctacacct gttggcaaag 84 0 

ggacagaaaa cagttttcct tccccaaaag ctataccaaa tggttttggt accagtccac 90 0 

taactccttc ggccaggata tcagcgctaa acattgtggg ggatctctta cggaaagtag 960 

gggctttaga atccaaatta gcagcttgca ggaattttgc aaaggaccaa gcatcacgga 102 0 

aatcctatat ttcagggaat gttaactgtg gggtgatgaa cagcaacggc acaaagttct 1080 

ctcgatcagg gcatacatct ttcttcgaca aaggggcagt aaatggcttt gacccagctc 114 0 

ctcctcctcc tggcctgggc tcctcgcgcc cgttgtcagc acccggtatg ctgccgctca 1200 

gtgtgtgagt gcccggcctc cggggctcct gccctcctcc aacaacccag gacacccacg 1260 

cctcacccct cggtgcctgg gcccagccct gtgcccctcc atctgcctcc ccacacggct 1320 

ggcagagggc aggctgcatg cagtggcggt gcttcggccc tgcccagccc caggactctg 1380 

cgcgatatca atactggcta ttttctcttc tcgccgtagt gccgttggtt tcacatgatt 1440 

gcacttttgt gggtcacgag gtgatacata cttgtattac ttggtcactg gatgcagaag 150 0 

tacccatttg tcatccctgc ctcatagccc ccgccctgct gtactgatag gatttagttg 1560 

tgtttaggac attgcgaatc ttctacaagt tctcccccaa tcaggttgac acataccctc 1620 

ctcctgagcc ccccgagccc cctgggcgcc ctcagtgctc acgatcatgt gtttcccggc 1680 

cctaccccca gtctgggccc gttactgcca ggagtcagga aggtcgctga gttagggaat 174 0 

attgtctgta ctctcgtttt acgtagcagt tccatccata gactgcctcc agagcagtga 1800 

aacgccatgc tgagccccct ggcaggagcc tcgtgcctgg gcacgcacgg gctgagcctc 18 60 

aggccatctc ctcctccatg tgcctcagac tcgggggagg ggtgacggcg tccgtgccag 192 0 

tgtcctgtgc atcctttgat tactctcatg ctgcatttac tgtttacatt tgttttattg 1980 

tacataggtt tgtaaacatt attgcctaag atatttgtat ataacttggg ctttgtagct 2 04 0 

tttatttatt cagaactcat atggcatgtt aatgactccc gatggtgtcc tactctgggc 2100 

agctgtatag gatcatcatg tggttaaaaa accagttccc tcaaaaaaaa tcttttaatg 2160 

tggaaacaat aaatttcac 217 9 



<210> 2 

<211> 2393 

<212> DNA 

<213> Homo sapiens 

<400> 2 

agaatggcct cggacaccca ggcagtccct gacgtgtcgg ggaggagccg ggcgcggagg 60 



tacgctgagt ggagctcggg gctgcgtagg ggagctgagc cgagcggctg ggcgggcctg 12 0 

gccgggccag cggaggggag acgtcggttg agcggcggcg aacatgcgct tttgacacat 18 0 

tggaggcttt cttgatcatg gatggtgaag atataccaga tttttcaagt ttaaaggagg 240 

aaactgctta ttggaaggaa ctttccttga agtataagca aagcttccag gaagctcggg 30 0 

atgagctagt tgaattccag gaaggaagca gagaattaga agcagagttg gaggcacaat 360 

tagtacaggc tgaacaaaga aatagagact tgcaggctga taaccaaaga ctgaaatatg 42 0 

aagtggaggc attaaaggag aagctagagc atcaatatgc acagagctat aagcaggtct 48 0 

cagtgttaga agatgattta agtcagactc gggccattaa ggagcagttg cataagtatg 540 

tgagagagct ggagcaggcc aacgacgacc tggagcgagc caaaagggca acaatagttt 60 0 

cactggaaga ctttgaacaa aggctaaacc aggccattga acgaaatgca tttttagaaa 660 

gtgaacttga tgaaaaggaa tctttgttgg tctctgtaca gaggttaaag gatgaagcaa 72 0 

gagatttaag gcaagaacta gcagttcggg aaagacaaca ggaagtaact agaaagtcgg 78 0 

ctcctagctc tccaactcta gactgtgaaa agatggactc cgccgtacaa gcatcacttt 840 

ctttgccagc tacccctgtt ggcaaaggaa cggagaacac ttttccttca ccgaaagcta 90 0 

taccaaatgg ttttggtacc agtccactaa ctccctctgc taggatatca gcactaaaca 960 

tcgtggggga tctcttacgg aaagtagggg ctttagaatc caaattagca gcttgcagga 102 0 

attttgcaaa ggaccaagca tcacgaaaat cctatatttc agggaatgtt aactgtgggg 108 0 

tgctgaatgg caatggcaca aagttctctc gatcagggca tacatctttc ttcgacaaag 114 0 

gggcagtaaa cggctttgac cccgctcctc ctcctcctgg tctgggctcc tcgcgtccat 1200 

cgtcagcgcc gggtatgctg cctctcagtg tgtgagtgcc tagcctccag gtgggggctc 1260 

ctgccctcct ccaacaaccc aggacaccca cgcctcaccc ctcggtgcct gggcccagcc 132 0 

ccgtgcccct ccgtctgcct ccgcacggct ggcagagggc aggctgcatg cagtggcggc 1380 

tactgggccc tgcccagccc cggaactctg cgcgatatca atactggcta ttttctcttc 1440 

tcgccgtagt gccgttggtt tcacatgatt gcacttttgt gggtcgcaag gtgatacata 150 0 

cgtgtattac ttggtcactg gatgcagaag tacccattca tcacacctgc cccatagccc 1560 

ccactctgct gtactgatag gatttagttg tgttttagga cattgcaaat cttctagaag 162 0 

ttctccccca aatcaggtca atgtgtgccc tcctgagctc ccacccaggc atctccagtg 1680 

ctcatgatca tgtgtccccc aactccaccc ctcacagttt gggcctgttt ctggcaaaga 174 0 



gtcaggaagg ttactgaatt agggaacatt ttctgcacct tctgatttta cttaagcagc 180 0 

taccattcca tggacttgcc tcccagagca gcacaatgcc cgtctgagcc ccacgtggca 1860 

ggagcctctg ggacggggca cacacaggcc cagcctctgt gctgtctcct cctctgtgcg 192 0 

cctcagactc ggggtgaggg aggcgggcag cctctcgcca gccttcccgt ccttcagttc 198 0 

aacgacatct ttggagtgtt tttgttttct cttccaaggg ccgtcccgtt gtgttaggaa 2040 

gggtgagtgg ctggttccag ggtgggccgg tgccagctcc ggggtggact gaacagcggc 210 0 

ggctgtccct gtgcatcctt tgattactct catgctgcat ttactgttta catttgtttt 2160 

attgtacata ggtttgtaaa cattattgcc tgagatattt gtatataact tgggctttgt 2220 

agcttttatt tattcagaac gcatacggca tgttaatgac tctgatggtg tcctcctctg 2280 

ggcagctgta taggatcatc atgtggttac aaaaaatact tccctcaaaa aaattctttt 2 34 0 

aatgtggaaa caataaattt cacagaaaaa aaaaaaaaaa aaaaaaaaaa aaa 2 393 



<210> 3 

<211> 345 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Met Asp Gly Glu Asp lie Pro Asp Phe Ser Ser Leu Lys Glu Glu Thr 
15 10 15 



Ala Tyr Trp Lys Glu Leu Ser Leu Lys Tyr Lys Gin Ser Phe Gin Glu 
20 25 30 



Ala Arg Asp Glu Leu Val Glu Phe Gin Glu Gly Ser Arg Glu Leu Glu 

35 40 45 



Ala Glu Leu Glu Ala Gin Leu Val Gin Ala Glu Gin Arg Asn Arg Asp 
50 55 60 



Leu Gin Ala Asp Asn Gin Arg Leu Lys Tyr Glu Val Glu Ala Leu Lys 
65 70 75 80 



Glu Lys Leu Glu His Gin Tyr Ala Gin Ser Tyr Lys Gin Val Ser Val 
85 90 95 



Leu Glu Asp Asp Leu Ser Gin Thr Arg Ala lie Lys Glu Gin Leu His 
100 105 110 



Lys Tyr Val Arg Glu Leu Glu Gin Ala Asn Asp Asp Leu Glu Arg Ala 
115 120 125 



Lys Arg Ala Thr lie Val Ser Leu Glu Asp Phe Glu Gin Arg Leu Asn 
130 135 140 



Gin Ala lie Glu Arg Asn Ala Phe Leu Glu Ser Glu Leu Asp Glu Lys 
145 150 155 160 



Glu Ser Leu Leu Val Ser Val Gin Arg Leu Lys Asp Glu Ala Arg Asp 
165 170 175 



Leu Arg Gin Glu Leu Ala Val Arg Glu Arg Gin Gin Glu Val Thr Arg 
180 185 190 



Lys Ser Ala Pro Ser Ser Pro Thr Leu Asp Cys Glu Lys Met Asp Ser 
195 200 205 



Ala Val Gin Ala Ser Leu Ser Leu Pro Ala Thr Pro Val Gly Lys Gly 
210 215 220 



Thr Glu Asn Thr Phe Pro Ser Pro Lys Ala lie Pro Asn Gly Phe Gly 
225 230 235 240 



Thr Ser Pro Leu Thr Pro Ser Ala Arg lie Ser Ala Leu Asn lie Val 
245 250 255 



Gly Asp Leu Leu Arg Lys Val Gly Ala Leu Glu Ser Lys Leu Ala Ala 
260 265 270 



Cys Arg Asn Phe Ala Lys Asp Gin Ala Ser Arg Lys Ser Tyr lie Ser 
275 280 285 



Gly Asn Val Asn Cys Gly Val Leu Asn Gly Asn Gly Thr Lys Phe Ser 
290 295 300 



Arg Ser Gly His Thr Ser Phe Phe Asp Lys Gly Ala Val Asn Gly Phe 
305 310 315 320 



Asp Pro Ala Pro Pro Pro Pro Gly Leu Gly Ser Ser Arg Pro Ser Ser 
325 330 335 



Ala Pro Gly Met Leu Pro Leu Ser Val 
340 345 



<210> 4 

<211> 7 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic peptide 

<400> 4 

Gly Phe Ala Pro Phe Arg Gin 
1 5 



<210> 5 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic peptide 

<400> 5 

Arg Pro Pro Gly Phe Ser Pro Phe Arg 
1 5 



<210> 6 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic peptide 
<400> 6 

Glu Leu Tyr Glu Asx Lys Pro Arg Arg Pro Tyr lie Leu 
15 10 



<210> 7 

<211> 5 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic peptide 

<400> 7 



Tyr Gly Gly Phe Leu 



1 



5 



<210> 8 

<211> 5 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic peptide 

<400> 8 



Tyr Gly Gly Phe Met 
1 5 



